Rough Set Based Unsupervised Feature Selection Using Relative dependency Measures
نویسندگان
چکیده
Feature Selection (FS) is a process which attempts to select features which are more informative. It is an important step in knowledge discovery from data. Conventional supervised FS methods evaluate various feature subsets using an evaluation function or metric to select only those features which are related to the decision classes of the data under consideration. However, for many data mining applications, decision class labels are often unknown or incomplete, thus indicating the significance of unsupervised feature selection. However, in unsupervised learning, decision class labels are not provided. The problem is that not all features are important. Some of the features may be redundant, and others may be irrelevant and noisy. In this paper, we propose a new rough set-based unsupervised feature selection using relative dependency measures. The method employs a backward elimination-type search to remove features from the complete set of original features. As with the classification performance is evaluated using WEKA tool. The method is compared with an existing supervised method and demonstrates that it can effectively remove redundant features.
منابع مشابه
Fuzzy rough sets, and a granular neural network for unsupervised feature selection
A granular neural network for identifying salient features of data, based on the concepts of fuzzy set and a newly defined fuzzy rough set, is proposed. The formation of the network mainly involves an input vector, initial connection weights and a target value. Each feature of the data is normalized between 0 and 1 and used to develop granulation structures by a user defined α-value. The input ...
متن کاملA hybrid filter-based feature selection method via hesitant fuzzy and rough sets concepts
High dimensional microarray datasets are difficult to classify since they have many features with small number ofinstances and imbalanced distribution of classes. This paper proposes a filter-based feature selection method to improvethe classification performance of microarray datasets by selecting the significant features. Combining the concepts ofrough sets, weighted rough set, fuzzy rough se...
متن کاملComparing Fuzzy-Rough and Fuzzy Entropy-assisted Fuzzy-Rough Feature Selection
Feature Selection (FS) methods based on fuzzy-rough set theory (FRFS) have employed the dependency function to guide the FS process with much success. More recently a method has been developed which uses fuzzy-entropy [9] to perform this task. Such use of fuzzy-entropy as an evaluation measure in fuzzy-rough feature selection can result in smaller subset sizes than those obtained through FRFS a...
متن کاملFeature Selection Based on Relative Attribute Dependency: An Experimental Study
Most existing rough set-based feature selection algorithms suffer from intensive computation of either discernibility functions or positive regions to find attribute reduct. In this paper, we develop a new computation model based on relative attribute dependency defined as the proportion of the projection of the decision table on a condition attributes subset to the projection of the decision t...
متن کاملFeature Selection for Traditional Malay Musical Instrument Sound Classification Using Rough Set
With the growing volume of data and feature (attribute) schemes, feature selection has become a very vital aspect in many data mining tasks including musical instrument sounds classification problem. The purpose of feature selection is to alleviate the effect of the „curse of dimensionality‟. This problem normally deals with the irrelevant and redundant features. Using the whole set of features...
متن کامل